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<^ ! ABSTRACT 

We report R ^ 4300 Very Large Telescope SINFONI adaptive optics integral field K-band 

<^ . spectroscopy of the nucleus of the Seyfert 1 galaxy NGC 3227 at a spatial resolution of 0.085" 

l/^ I (7pc). We present the morphologies and kinematics of emission lines and absorption features, 

^N ■ and give the first derivation of a black hole mass in a Seyfert 1 nucleus from spatially resolved 

_j. , stellar dynamics. We show that the gas in the nucleus has a mean column density of order 10^^- 

f^ ' lO^^cm"^ and that it is geometrically thick, in agreement with the standard 'molecular torus' 

>0 ■ scenario. We discuss which heating processes may be responsible for maintaining the vertical 

^""■^ height of the torus. We have also resolved the nuclear stellar distribution, and find that within a 



:^ 



Oh' 



X 



few parsecs of the AGN there has been an intense starburst. The most recent episode of which 
began ^^40 Myr ago but has now ceased. The current luminosity of stars within 30 pc of the 

O I AGN, '--^3 X 10^ Lq, is comparable to that of the AGN. Based on a comparison of the respective 

size scales, we argue that the star formation has been occuring in the obscuring torus. Finally, 

Cj ■ we present the first derivation of a black hole mass in a Seyfert 1 nucleus from stellar dynamics 

which marginally spatially resolve the black hole's sphere of influence. We apply Schwarzschild 
orbit superposition models to our full 2-dimensional data and derive the mass of the black hole, 
paying careful attention to the input parameters which are often uncertain: the contribution of 

C^ I the large scale bulge and its mass-to-light ratio; the recent star formation in the nucleus and its 

mass-to-light ratio; the contribution of the gas mass to the potential; and the inclination. Our 
models yield a Icr range for the black hole mass of Mbh = 7 x 10^-2 x 10^ Mq. 

Subject headings: galaxies: active — galaxies: individual: NGC3227 — galaxies: nuclei — 
galaxies: Seyfert — galaxies: starburst — infrared: galaxies 



1. Introduction 

Three issues concerning active galactic nuclei (AGN) which are currently receiving considerable atten- 
tion, and which we address in this paper, are: the relationship of the AGN to the surrounding star formation, 
the central black hole mass, and the physical properties of the obscuring torus. 



^Based on observations at the European Southern Observatory VLT (074.B-9012). 



The issue of star formation in AGN has long been contentious, ahhough it is now clear that on 0.1- 
1 kpc scales star formation is an important process in all types of AGN (Cid Fernandes 2004). In particular 
empirical population synthesis of Seyfert 2 nuclei (Cid Fernandes et al. 2001; Gonzalez Delgado, et al. 2001; 
Storchi-Bergmann et al. 2001) reveals evidence for recent (<lGyr) star formation within a few hundred 
parsecs of the nuclei in about 40% of Seyfert 2 nuclei. While some authors (e.g. Joguet et al. 2001; Gu et 
al. 2001) reach similar conclusions, not all concur (Ivanov et al. 2000). Mid- and Far- infrared ISO data 
(Verma et al. 2005) also have much to offer on this issue. The situation regarding Seyfert 1 nuclei is rather 
more uncertain, and a criticism (equally applicable to Seyfert 2s) often levelled is that it lies so far from the 
AGN that it cannot be associated with or strongly influencing the characteristics of the AGN (Heckman et 
al. 1997). Nevertheless, Davies et al. (2004a, b) report two cases where powerful starbursts do exist within 
the central few tens of parsecs of type 1 AGN. 

Beyond this debate is the fundamental question concerning the physics linking the two phenomena 
through a causal relationship (Shlosman & Begelman 1989; Goodman & Tan 2004; Thompson et al. 2005) - 
which is related to, although more complex than, driving gas to small scales. While observation and theory 
provide strong indications linking gas inflow and circumnuclear starbursts to AGN and nuclear starbursts, 
the statistical evidence is tenous (Knapen 2004; Wada 2004; Shlosman 2003). For example, HST imaging 
shows no dependence of AGN activity on the presence of a nuclear spiral (Martini et al. 2003). Similarly, 
AGN are equally common in galaxies with and without a compact nuclear gas component (Garcia-Burillo 
et al. 2003). 

A parallel theme is the molecular obscuring torus (Antonucci & Miller 1985; Antonucci 1993), the heart 
of the unification scheme. Many models of the thermally re-radiated spectrum of the putative torus have 
been constructed, both as a uniform ring (Pier & Krolik 1992a, 1993; Granato & Danese 1994; Schartmann 
et al. 2005), and also as a clumpy medium (Nenkova et al. 2002) as was proposed for NGC 1068 a decade 
earlier by Cameron et al. (1993). In all cases, the crucial properties are a spatial scale of tens (to perhaps 
hundreds) of parsecs, a vertically extended geometry, and a high column density. In the last few years, 
models have begun to consider star formation within the torus (Wada & Norman 2002; Thompson et al. 
2005), giving credence - albeit with a different perspective - to some early ideas about stellar light in AGN 
spectra. 

The mass of the central supermassive black hole Mbh is the third issue we address. The masses of 
black holes in nearby AGN are most commonly estimated by reverberation mapping (Onken et al. 2004), a 
technique that can be extended to higher redshift via additional scaling relations (Kaspi et al. 2000, 2005; 
Vestergaard 2002, 2004). Providing an independent measure of Mbh for reverberation masses would allow 
one to begin to understand the geometry of the broad line region, which is the limiting factor in the accuracy 
of the method (Home et al. 2004). 

Estimates of such masses are important with respect to the relation between the Mbh in the center 
of a stellar spheroid and the velocity dispersion a^ (Ferrarese & Merritt 2000; Gebhardt et al. 2000). This 
relation has superseded similar ones using the luminosity or mass of the spheroid (Kormendy & Riehstone 
1995; Magorrian et al. 1998) to become a cornerstone in the cosmological context of galaxy evolution and 
black hole growth. It is generally accepted that the Mbh — cr* relation should be valid for all spheroids 
irrespective of whether the black hole is quiescent or active, whether the spheroid in which it lies is a 
globular cluster or a giant elliptical galaxy, whether the spheroid is embedded in a gaseous disk or not, and 
so on. However, almost without exception the black hole masses that are considered 'reliable' - typically 
those based on stellar kinematics and for which the radius of influence of the black hole has been resolved - 
have been derived only for nearby bulge dominated E/SO galaxies (Tremaine et al. 2002; Marconi & Hunt 



2003; Ferrarese & Ford 2005). The smaller bulges of spiral galaxies imply lower Mbh, making it difRcult to 
spatially resolve the stellar kinematics. And for AGN, where the black hole is by definition active rather 
than quiescent, the glare of the AGN itself is an added obstruction. To date there is only one such published 
mass, ^ 2 X 10^ Mq for Gen A (Silge et al. 2005). Despite this galaxy's proximity, seeing limited observations 
are only able to resolve the radius of influence of the black hole because the black hole is at least 5 times 
more massive than expected from the Mbh — f* relation. Given this result, it is crucial that black hole 
masses are derived using stellar kinematics at high spatial resolution for more AGN. 

In this paper we look in detail at these topics for the Seyfert 1 nucleus of NGG 3227. We probe the star 
formation and molecular gas in the central 80 pc at a spatial resolution of only a few parsecs. In particular 
we discuss how the young stars and gas relate to our understanding of the canonical torus. In addition we 
derive the mass of the black hole from Schwarzschild modelling of the stellar dynamics. This is the first 
time that this has been possible for a Seyfert 1 galaxy, where using integral field capability we measure the 
kinematics across the full 2-D field. We pay careful attention to issues which are often rather uncertain 
in Schwarzschild orbit superposition models: the inclination of the system, the mass-to-light ratio, and the 
contribution of gas to the gravitational potential. 

The distance to NGG 3227 of 17Mpc (for Hq = 70-75 kms^^ Mpc^^) is estimated from the 1250 kms^^ 
luminosity weighted average velocity of the 13 galaxies in the group of which it is a member (Garcia 1993). Its 
infrared (8-1000/im) luminosity is then log Lm/ Lq ~ 9.93. However, this does not represent the bolometric 
luminosity since the spectral energy distribution XF\ is approximately flat in the range 0.3-1000/zm, and 
perhaps to even shorter wavelengths. Taking this into account we estimate the bolometric luminosity to be 
logLboi/i© = 10.2. 



2. Observations and Data Reduction 

The data presented here were obtained on 21 Dec 2004 using SINFONI (Eisenhauer et al. 2003a; Bonnet 
et al. 2004) on the VLT UT4. The instrument consists of a cryogenic near infrared integral field spectrometer 
SPIFFI (Eisenhauer et al. 2003a,b) coupled to a visible curvature adaptive optics (AG) system (Bonnet et al. 
2003). The AG module was able to correct on the nucleus of NGG 3227 (for which it measured R=13.9 mag) 
in seeing of ~0.6", to reach nearly the diffraction limit of the telescope in the K-band (an estimated 15% 
Strehl). With the appropriate pixel scale selected, the spectrograph was able, in a single shot, to obtain 
spectra covering the whole of the K-band (approximately 1.95-2.45/xm) at a spectral resolution of i? ^ 4300 
for each 0.0125" x 0.025" pixel in a 0.80" x 0.80" field of view. A total of 3 sky and 6 on-source exposures 
of 600 sec each, dithered by up to 0.2", were combined to make the final data cube. 

The data were reduced using the SINFONI custom reduction package SPRED. This performs all the 
usual steps needed to reduce near infrared spectra, but with additional routines for reconstructing the data 
cube. Because the sky airglow did not vary much during the observations and is low compared to the read 
noise at such small pixel scales, it was possible to make a combined sky frame which could be subtracted from 
all the on-source frames without leaving OH line residuals. Following this step, the frames were flatfielded 
and corrected for dead/hot pixels. The data were then interpolated to linear wavelength and spatial scales, 
after which the slitlets were aligned and stacked up to create a cube. Finally the atmospheric absorption was 
compensated using the B9 star HD 83434; and flux calibration was performed using HD 83434 (K=6.917) 
and HD 87015 (K=6.144), which both yielded a zeropoint of 16.08 mag. 



Spatial Resolution No additional point-spread function (PSF) calibration frames using stars were taken. 
This is primarily because, although in principle one can match the brightness of a calibration star on the 
wavefront sensor to the AGN, it is not possible to replicate either the spatial extent of the AGN or the 
background galaxy light associated with it. Together with changes in the ambient seeing which can occur 
on relatively fast timescales, this can result in a potentially considerable mismatch between the science and 
calibration PSFs (Davies et al. 2004c). Since, for the analysis presented here, a highly accurate PSF is not 
needed, we have instead made use of the broad Br7 emission which is unresolved: it has been measured to be 
of order 0.02 pc by Salamnca et al. (1994) and Onken et al. (2003). This has the advantage of providing the 
resolution directly from the science frames, and includes all effects associated with shifting and combining 
the cube, and also smoothing (a 0.0375" x 0.0375" median filter was applied to the final cube). The PSF 
we use for convolving our models is then approximated by a symmetrical fit to the broad Br7 emission as 
shown in Fig. 1. In this figure we have plotted the value for every pixel within 0.3" of the centre, from 
which it can be seen there are no asymmetric artifacts resulting from the AO correction. Overdrawn are 
both Gaussian and Moffat profiles. The latter, as expected, matches the faint extended wings in the profile; 
but both reproduce the core of the profile equally well. They yield a FWHM resolution of 0.085". 

Derivation of the Kinematics and their Uncertainties The velocity and dispersion of the emission 
lines is found by simultaneously fitting the continuum level with a linear function and the line itself with a 
Gaussian. The uncertainty of the fit is estimated using Monte Carlo techniques assuming that the noise is 
uncorrelated and the line is well represented by a Gaussian. The method involves adding a Gaussian with 
the measured properties to a spectral segment with the same noise statistics as the data, and re-fitting the 
line to yield a new set of Gaussian parameters. After repeating this 100 times, the standard deviation of the 
center and dispersion are used as the uncertainties for the velocity and line width. 

The shape of the CO absorption bandheads in a galaxy spectrum arises from the intrinsic spectral 
profile of the stars themselves convolved with a broadening profile which carries the information about the 
kinematics. The use of template stars (we chose the KO I star HD 179323) allows one to separate these two 
components. One technique commonly used is deconvolution with the Fourier cross-correlation quotient 
method. The principal advantage of this method is that it allows one to recover the full broadening profile 
even if it is not well represented by a function such as a Gaussian (plus Hermite terms). However, because 
the Fourier transform of the object is divided by that of the template, the noise is amplified; and while it 
can be reduced by Wiener filtering, setting the optimal parameters of the filter is only possible if the noise 
and signal can be properly distinguished in Fourier space. If this is not done correctly, it can be difficult to 
obtain the correct object dispersion. The alternative is to convolve the template with an analytical function 
and iteratively minimise its difference to the object using a y^ criterion. The limitation of this technique 
is its speed and the fact that the convolution function needs to be relatively simple. On the other hand, it 
is less adversely affected by noisy data and one can easily reject bad data values. In addition it provides a 
clear path to estimating the uncertainties using standard techniques. One can estimate the confidence level 
for each parameter separately by finding the value for which, with all the other parameters re-optimised, ^ 
increases by 1. 

It is the latter method which we have used because the individual spectra for each spatial pixel are 
rather noisy, and we only extract the usual three parameters for a Gaussian fit. Tests have shown that for 
these data it is not possible to determine coefficients for the Hermite polynomials which would indicate the 
deviations from a Gaussian. 

We have only made use of the first (i.e. CO 2-0) bandhead, because [Caviii] emission at the edge of 



the CO 3-1 strongly biasses the derived velocities (making them much more positive where the line emission 
is strongest) when this second bandhead is used. The third CO 4-2 bandhead adds very little because it is 
strongly affected by residual atmospheric features. 

Smoothing All the images presented in this paper have been smoothed using optimal Voronoi tessellations, 
as described and implemented by Cappellari & Copin (2003). This scheme uses adaptive binning to group 
pixels so that the combined signal to noise of each group is above a minimum threshold. Pixels which are 
already above the threshold are not binned. Thus the final resolution varies across the image, although at 
no point is it improved beyond the original resolution in the data. 

The algorithm bins pixels together into groups by accreting new pixels to each group according to how 
close they are to the centroid of the current group. Checks ensure that each time a pixel is added to a 
group, both the group remains 'round' and the signal to noise increases. The resulting groups then provide 
a set of positions (centroids) and mean fluxes which are used as the initial 'generators'. A further algorithm 
optimises the generator configuration based on a centroidal Voronoi tessellation. The final set of generators 
are the positions of the flux-weighted centroids of each binned group, and have the property that each pixel 
in the original image is assigned to the group corresponding to the nearest generator. Thus the positions of 
the generators provide all the information necessary to re-create the binned image. 

Since it can be hard for the human eye to comprehend the binned images created in this way, we have 
- for the purposes of presentation only - applied a final step of interpolating all the pixel values from the 
generators of the Voronoi tessellation using a minimum curvature spline surface (as implemented in IDL 
version 6.0). In the resulting smooth images, pixels with high signal to noise (i.e. unbinned pixels) have very 
nearly the same value as the original image. 



3. The Circumnuclear Stellar Ring 

When the pixel scale of SINFONI is matched to the diffraction limit of the VLT, its field of view is only 
~1". An important step is therefore to understand the larger scale context in which these data reside. To 
facilitate this we have used H-band data - which trace the stellar light with less bias to recent star formation 
than optical data - from both the 2MASS large galaxy atlas (Jarrett et al. 2003) on scales of 2-100", and 
the Hubble Space Telescope [HST] archive on scales of 0.2-10" (proposal 7172, Rieke). Before analysing the 
HST F160W image, it was deconvolved using a PSF generated by Tiny Tim 6.2, rotated to the standard 
orientation and trimmed to 13.5" on a side. Isophotal fitting (using the ellipse task in iraf) was performed 
on both these images and the resulting profiles scaled according to their overlapping region. Additionally, 
a bulge-plus-disk model (the parameters of the disk being fixed at the values determined from the large 
scale 2MASS data) was fit to the F160W image, excluding radii smaller than 1.6", using a y^ minimisation. 
This model was then divided into the F160W image in order to investigate the excess continuum emission 
apparent at radii of 1-1.5". 

On the largest scales there is an almost perfect exponential profile with a disk scale length of 26.8" 
(2.1 kpc) and an axis ratio of 0.4-0.6 (corresponding to an inclination of 55-65°) at a position angle (PA) of 
^ —30°. At smaller scales, there is a smooth transition to an r^/^ profile with an effective radius of rgs = 3.4" 
(270 pc), which dominates the emission at radii 2-9". This r^ii is rather larger than the 2.6" found by Nelson 
et al. (2004) from optical data, probably because we have less bias from nuclear star formation and the AGN. 



What was previously reported by Chapman et al. (2000) as a possible knotty one-armed spiral is now 
revealed in Fig. 2 as a ring of excess continuum with radius 1.7" (140 pc). Its axis ratio of 0.6 and PA of 
—30°) are consistent with both the large scale disk and also the circumnuclear molecular gas ring reported 
by both Schinnerer et al. (2000) and Baker (2000). The kinematics of the CO (2-1) indicate that the gas lies 
in a disk; and the stellar ring indicates that the stars also lie in a disk, since resonances such as these are 
features of disk dynamics. The similarity of the loci traced in both cases suggests that the stellar and gas 
disks are in fact the same. The ring was hypothesised by Schinnerer et al. (2000) to be at the location of 
an inner Lindblad resonance (ILR) associated with a secondary inner bar on scales smaller than 20", which 
is itself the ILR for the primary bar. Evidence for such a secondary bar was presented by Baker (2000) 
who noted elongated molecular structures along the major axis at radii of 10-15", which he suggests may 
be concentrations of gas at the leading edges of a bar. However, he cautions that the velocities are different 
to those expected for gas inflow along the bar. 

The 1" field of view of SINFONI lies entirely inside the 140 pc ring, as indicated on Fig. 2. Its view is 
presented in Fig. 3, where we show images of the 2.1^m continuum, the stellar continuum measured from 
the CO 2-0 absorption bandhead, the 2.12/xm H2 1-0 S(l) line, the 2.17/L(m Br7 line, and the coronal [Caviii] 
line at 2.32/im. Fluxes and flux densities are summarised in Table 1. The corresponding kinematics for the 
1-0 S(l) and Br7 lines and the stellar CO 2-0 absorption arc shown in Fig. 4. In the Icfthand panels we show 
the full field that can be measured. In the centre panels we have applied a mask to show only velocities 
associated with the brightest pixels: those that contribute 2/3 of the total flux, and hence the bulk of the 
emitting medium, in the field. Velocity dispersions are shown in the righthand panels. 



4. Nuclear Star Formation 

There is some evidence in the literature that points to a scenario involving recent vigorous star formation 
in the nucleus of NGC 3227. Based on an analysis of seeing limited near infrared integral field spectroscopy, 
Schinnerer et al. (2001) argued that there was a 25-50 Myr cluster present, although they could not rule out 
a much older population. In addition, there is the detection of the 3.3/im polycyclic aromatic hydrocarbon 
(PAH) feature in a 0.8"xl.7" slit aperture by Rodriguez-Ardila & Viegas (2003). They reported that the 
ratio is.a/LiR ^ 7 x 10^^ is lower than the mean for starbursts but typical of Seyfert 2s. PAHs have also 
been detected in AGN where star formation is also active, perhaps most notably Mkn231 (Rigopoulou et al. 
1999) and NGC 7469 (Mazzarefla et al. 1994); Davies et al. (2004a,b) have shown beyond doubt that these 
galaxies host massive nuclear star formation. In this Section we address the questions of the presence and 
age of a distinct stellar component in the nucleus. We consider evidence from the stellar absorption features 
(Sections 4.1 and 4.2), the narrow Br7 (Section 4.3), and radio continuum (Section 4.4). Finally we apply 
population synthesis models in Section 4.5. 



4.1. Late Type Stars and Stellar Luminosity 

The CO bandheads at A > 2.29/im are often used as tracers of star formation in the K-band since, once 
they first appear, late type stars dominate the near infrared stellar continuum. Under the assumption that 
either the equivalent width of the CO 2-0 bandhead, M^co, is independent of stellar population or that the 
stellar population does not change across the region of interest, the absolute absorption in the bandhead 
provides a direct tracer of stellar luminosity. However, given that Forster Schreiber (2000) and others have 



shown that Wqo in individual stars can vary from 0-20 A depending on stellar type (effective temperature), 
the validity of this assumption is far from clear. Fortunately, there is observational and theoretical evidence 
that it is valid. Oliva et al. (1995) found very little variation in Wco between elliptical, spiral, and Hii 
galaxies. In addition, star cluster models indicate that for ensembles of stars, Wqo is much more stable, 
reaching a value close to 12A once the age of the cluster has exceeded lOMyr. This is demonstrated in 
Fig. 5 which shows how Wqo varies as a function of age for various star formation histories. The data 
were generated using the population synthesis code STARS (Sternberg 1998; Thornley et al. 2000; Davies 
et al. 2003; Sternberg et al. 2003; Davies et al. 2005), which calculates the distributions of stars in the 
Hertzsprung-Russell diagram as a function of age for exponentially decaying star formation rates. Using 
empirically determined Wco from library spectra (Forster Schreiber 2000), the code then computes the 
time-dependent Wco for the entire cluster of stars. The code includes the thermally pulsing asymptotic 
giant branch (TP-AGB) stars which have a very significant impact on the depth of the absorption features 
at ages of 0.4-2 Gyr (Forster Schreiber et al. 2003; Maraston 2005). Throughout this work, we adopt a solar 
metallicity Salpeter IMF in the range 1-100 M0. 

Fig. 5 shows that once CO absorption is present, Wco does not deviate from 12A by more than 20% 
except in the special case of instantaneous star formation with an age of less than 15 Myr. This insenitivity 
of Wco to star formation history means that, although it cannot differentiate between young and old stellar 
populations, it can be used to distinguish the stellar and non-stellar continuum and to trace the total stellar 
luminosity profile. The observed Wco = 3.6 A (measured in the bandpass prescribed by Forster Schreiber 
2000) implies a mean dilution (i.e. ratio of total continuum to stellar continuum) in a 0.8" aperture of 3.3 at 
2.3/^m. The average dilution over the whole K-band will be less - as small as 2.3 if the hottest dust associated 
with the AGN is at a typical temperature of 500 K within this aperture (or hotter but also reddened) . We 
have adopted an intermediate value of 2.8 corresponding to a characteristic temperature of 1000 K. Within 
the aperture, we find a K-band magnitude of 10.28 (a little brighter than that given by Schinncrcr et al. 2001, 
perhaps due to their lower resolution or because of the wings often associated with the PSF of shift-and-add 
data), which implies a stellar magnitude of 11.4 or equivalently a 1.9-2.5/xm luminosity of logLx/i© = 7.8. 

This luminosity includes both a possibly young population as well as the old bulge population. For- 
tunately, the respective contributions from these two components can be disentangled, as the top panel 
of Fig. 6 shows. This figure includes data at different scales from various sources, all of which have pros 
and cons: 2MASS H-band covers the largest scales but cannot probe scales less than 2-3" due to limited 
resolution; HST F160W image covers intermediate and small scales at high resolution, but cannot probe 
radii smaller than ~0.2" due to the bright point source associated with the AGN; SINFONI total K-band 
continuum covers the central arcsec at high spatial resolution, but is limited by the AGN at radii less than 
^^0.05"; SINFONI stellar K-band continuum probes the very smallest scales without being affected by the 
AGN, but is noisier (see Fig. 3). The large scale disk and bulge models discussed in Section 3 reveal an excess 
of emission at radii less than 0.5" that increases to become very significant closer than 0.2" from the centre. 
Presumably this excess is associated with the recent star formation that is also the origin of the narrow Br7 
fiux. It accounts for 60% of the continuum in a 0.8" aperture. Hence the luminosity of the young stellar 
population is logiK/-^© — 7.6. 

When compared to the luminosities of star clusters predicted by STARS (e.g. see Davies et al. 2003) 
or to individual clusters in other nearby galaxies, this suggests a very significant (probably young) stellar 
component within 30 pc of the AGN. 



4.2. Nuclear Stellar Luminosity Profile 

The 2.1/iiii continuum (similar to a broadband K image) is dominated by tlie non-stellar continuum 
associated with the AGN and is barely resolved, with a FWHM of 0.10" (cf the resolution of 0.085"). On 
the other hand, a simple size measurement of the stellar continuum yields a FWHM of 0.17" indicating that 
the nuclear stellar component is easily resolved, having an intrinsic size scale of ~12pc. 

The axis ratio of 0.8 and PA —10° evident in the outer isophotes of the total stellar distribution - also 
apparent in the faintest levels of the 2.1/um image - differ only slightly from those of the 140 pc scale ring. 
This can be seen in Fig. 6: from radii of 10" where the bulge begins to dominate down to a scale of 2", 
the cUipticity of the isophotes decreases; but, as measured in the HST F160W data as well as the SINFONI 
continuum and stellar bandhead data, it increases again briefly at both the 1.7" radius of the ring and also 
at radii 0.2-0.5". The stellar kinematics (Fig. 4) show ordered rotation, albeit with a very large velocity 
dispersion, at PA -30° to -45°. 

It may be, as appears to be the case in NGC 7469 (Davies et al. 2004a), that the nuclear stellar cluster is 
triaxial, leading to an offset between the major (isophotal) axis and the kinematic axis. A triaxial potential 
could conceivably arise in a situation where a nested bar - perhaps associated with the 140 pc ring is 
dynamically heated and forms a mini pseudobulge (Kormendy & Kennicutt 2004). The timescale for such 
heating is of the order of 10^ yr for kpc-scale bulges and presumably shorter for nested bars where the pattern 
speed is faster and significant energy may be injected either by star formation or the AGN. If the heating is 
due to buckling instability, then it will result in the bar becoming weaker and more centrally compact (Raha 
et al. 1991), at least qualitatively consistent with what we are seeing here. 

However, in interpreting the stellar data one needs to bear in mind that it comprises similar contributions 
from the bulge and the nuclear component, the observed properties being the combination of these two 
components. As discussed above, the top panel of Fig. 6 disentangles these; and the excess emission revealed 
is drawn in more detail in Fig. 7. At radii ^ 0.5" the data show a clear break at about 0.15" characterised 
by a change in the radial luminosity gradient that is independent of the underlying bulge light (extrapolated 
from radii 2-10" and marked by a dashed line). The profile of the excess can be fit by an r^''* profile 
whose effective radius rcfr changes at 0.10-0.15". However, the two radii are r^i = 0.25" and re2 = 8.2", 
which are rather larger than the size scales over which the respective regions extend, leading to some doubt 
about whether they are physically meaningful fits. On the other hand, under the assumption that the excess 
continuum arises in a disk, it can be fit equally well by an exponential whose scale length r^ (note that disk 
scale length is related to effective radius by r^ = roff/1.68) changes at a radius of 0.11". At smaller radii 
Tdi ~ 0.037", while at larger radii rd2 — 0.38". 

In summary, the radial profile and rotational signature suggest a disk-like distribution for the nuclear 
component. On the other hand, the high dispersion and isophotal position angle (although both are biassed 
by the bulge component) indicate a thicker more spheroidal or even triaxial geometry. Taken together, these 
results suggest a thickened disk is the appropriate interpretation for the nuclear stellar light. 



4.3. Ionised Gas and Young Stars 

The Br7 map in the right hand panel of Fig. 3 shows only the narrow (FWHM 200-300 km s^^) com- 
ponent of the line, achieved by fitting out the broad (^3000 km s^^) component. The resulting emission is 
resolved along all PAs. The crucial question here is whether the bulk of the Br7 originates from an out- 



flow in the narrow line region (NLR) or from OB stars. On scales of 1-7" the [O in] emission is extended 
on PAs of +15° to +30° (MundeU et al. 1995; Schmitt & Kinney 1996), roughly along the galaxy's minor 
axis. However, Schmitt & Kinney (1996) also note a knot of emission ~ 0.25" from the nucleus that they 
suggested could be associated with the double-peaked radio continuum source in MundcU ct al. (1995). In- 
deed, approximately 0.2" north and south of the nucleus, the dispersion of the Br7 line is much larger than 
elsewhere, which could be indicative that these features are related and originate in the NLR. On the other 
hand, both the morphology and primary velocity gradient of the Br7 are oriented to the north-west - which 
is strong evidence for a direct relation to the stars and molecular gas. A quantitative discrimination can 
be made by assigning emission with a dispersion greater than 150 kms to the NLR, and the rest (which lies 
along the major axis) to star formation. We find that ~ 75% of the Br7 fiux most probably originates in 
star formation. Section 4.5 and Fig. 8 address the consequences on our star burst models of the uncertainty 
of this conclusion. 



4.4. Radio Continuum and Supernova Remnants 

A map of the 6 cm radio continuum at a resolution of 0.05" x 0.07", comparable to that we have achieved, 
was published by MundeU et al. (1995). This showed a 0.3" long structure, composed of knots of emission, 
elongated at PA —10°. In the 0.14"x0.17" resolution 18cm map these features were visible only as a 
bright compact source and a tail to the north; but an additional bright knot was also apparent 0.4" to the 
north. MundeU et al. (1995) suggested that these features might be radio jets - either one or, if the nucleus 
actually sits between the brightest knots, two sided. However, they noted a number of difficulties with this 
interpretation: lateral extensions were detected well above the noise level; and the jet orientation differs 
from that of the narrow line region, as traced by [Oiii]. They suggested that as in NGC4151 (e.g. Pedler 
et al. 1993), misalignment may be expected between the radio axis, which lies along the coUimation axis of 
the UV ionisation cone, and a density bounded extended narrow line region (ENLR). However, they also 
realised that the line widths of the [Oiii] are indicative of a NLR rather than an ENLR, and in addition for 
such a model to apply, the NE side of the disk would have to be closer and hence the spiral arms would have 
to be leading rather than trailing. To circumvent these difficulties, they suggested that the radio coUimation 
and [Oiii] outflow may not be due to the same mechanism. 

These discrepancies cease to be issues - indeed, actually are expected - if instead one attributes the 
radio continuum to supernova remnants (SNRs). In this scenario, the knotty morphology could arise from 
the superposition of many SNRs, some of which are perhaps bright enough to be detected as individual 
radio supcrnovae. Supporting this, the bright northern knot apparent in the 18 cm map is a result of the 
low resolution of that map: Fig. 9 shows that the same knot seen in a low resolution 6 cm map breaks up 
into discrete structures at higher resolution. For such an interpretion, the details of the morphology have no 
more signiflcance than tracing the locations of these stochastic events. 

The flux density of the individual 6 cm knots in NGC3227 is variable, with the brightest being 1-2 mJy 
(excluding the extended nuclear knot which probably comprises many SNR). This implies luminosities up to 
3-6 X 10^® WHz^^, fully consistent with the range of 1 1000 x lO^^WHz^^ for the peak 6cm flux density of 
galactic radio supernovae given by Weiler et al. (2002). For comparison, in Arp 220, 16 radio supernovae with 
18 cm luminosities 20-60 x 10^^ WHz~^ (a factor ~3 lower at 6 cm, assuming a spectral index a = —1) were 
reported by Smith et al. (1998); and for the 24 sources monitored in the nucleus of M 82, 6 cm luminosities are 
in the range 0.1-10 x 10^^ WHz^^ (Kronberg ct al. 2000). In both these galaxies, most of the SNRs appear 
to have varied rather little in flux over respectively 5 and 12 year periods (Rovilos et al. 2005; Kronberg et 
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al. 2000). The flux densities of the knots in NGC3227 are therefore certainly consistent with their being 
radio supernovae. 

We have superimposed the radio continuum data (naturaUy weighted, 0.076" x 0.053" beam size, kindly 
made available and re- reduced by C. Mundell) and SINFONI data, as shown in Fig. 9. Since no astrometrical 
alignment is possible, we have assumed that the brightest 6 cm emission coincides with the peak in the K- 
band stellar continuum - justifiable if, as we suspect, the 6 cm emission is due to star formation. For this 
interpretation, the exact astrometrical alignment is not critical. An alternative alignment could be for the 
near infrared nucleus to lie between the two main emission knots seen in the 18 cm map of Mundell et 
al. (1995). However, this seems unlikely because, as we have shown, the jet-like structure is in fact small 
spread-out knots seen at low resolution. There is perhaps some correspondance between these off-nuclear 
6 cm continuum knots, which tend to lie to the northwest, and the Br7 line which is also more extended in 
the same direction. Both of these can be understood in terms of a slight tendency for more, or more recent, 
star formation there. On the other hand, it is clear that there is little one-to-one correspondance between 
the 6cm continuum and 1-0 S(l) line. 

To estimate the emission within 0.8", we take two extreme limits: that for the nucleus (i.e. southern 
component) only, as well as the total, both given by Mundell et al. (1995). We estimate the maximum AGN 
contribution by measuring the flux in the central 0.090" (to include the full beam) as no more than 10% of 
the total. We have made no correction for this. Using the conversion given in Condon (1992) for these 6 cm 
and 18cm flux densities, we estimate the supernova rate to lie within the range 0. 008-0. 019yr^^. 



4.5. Star Cluster Models 

The discussion above has yielded two independent diagnostics which can be used to constrain the star 
formation history in the central arcsec of NGC3227. The first is the (classical) equivalent width of Br7. 
We use the ratio of the narrow Br7 associated with young stars (i.e. excluding the NLR contribution) to 
young stellar continuum, thus removing any bias associated with the AGN or bulge. The ratio we adopt is 
therefore M^Br7 = 5 A, with an uncertainty of no more than 1.5 A. The second is the ratio of the supernova 
rate to stellar continuum luminosity, which is 10^" t^sN(y^^^) / ^-iyX^q) — 2.2-5.3. These ratios are presented 
graphically together with models for various star formation histories in Fig. 8, allowing some conclusions to 
be drawn immediately. 

The WBr7 is too low for the star formation to be continuous, a result supported by the j^sn/^k which is 
sufficiently large that it permits only young ages. This conclusion is robust even to large uncertainties in the 
two parameters. In particular Fig. 8 shows that the conclusion is valid even if either WBr7 is overestimated by 
insufficient corretion for the NLR contribution, or if i^sn/-^k is overestimated by not correcting for a possible 
AGN component. Thus any star formation that did occur has now ceased. On the other hand, because 
there are already supernova remnants, at least lOMyr must have elapsed since star formation began. Thus 
an instantaneous burst is ruled out, because of the steepness with which T4^Br7 falls. For the intermediate 
histories, the age of the star formation is strongly constrained by i^sn/^K) even if the ratio itself is rather 
uncertain, to be at most 50 Myr. Even if one allows the uncertainty in the conversion from continuum flux 
density to supernova rate, it would be hard to reach ages greater than 100 Myr. With this in mind, the 
low WBr7 requires a short but finite burst timescale. The best model is for a burst with an e-folding decay 
time of 10 Myr which began 40 Myr ago. However, we emphasise that a unique match is not the aim of this 
modelling; the important result is that the star formation is very young, and that the active episode lasted 
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for a finite time but is now finished. This is in interesting result, begging the questions: since stars form in 
cold quiescent environments, how did such intense star formation occur in this environment which is clearly 
extremely turbulent (as evidenced by the 1-0 S(l) morphology in Fig. 3 and the velocity dispersion of 100- 
125 kms^^, which is significantly higher than the rotational velocity of 50-100 kms^^)? Once started, what 
is it that caused the star formation to cease while the gas is still so plentiful? We address these questions in 
Section 6. 

The properties we find for the star formation scenario given above for the measurements in a 0.8" 
aperture are summarised in Table 2. The star formation has an initial rate of 3M0yr~^, and a current 
rate of 0.05 Moyr^^. The current mass to K-band light ratio is M/Lk ~ 0.5Mq/Lq, where the mass of 
2.0 X 10^ M0 refers to the current live stars rather than the total gas consumed during the active phase 
which is almost 50% greater. Remarkably, the current bolometric luminosity attributable to these stars is 
logLboi/-^© = 9.5. This is a crucial result, since it implies that the extinction to the stars cannot be very 
great. 

Fig 7 shows that within the central 0.8" we discuss above, the intensity increases dramatically at 
r < 0.11" (9pc). Within this smaller radius, the star formation models show that the mean stellar mass 
surface density is 3 x IO^'Mqpc"^. Averaged over the last 40Myr (i.e. since the burst began) the mean 
absolute star formation rate is only O.13M0yr~^, but the rate per unit area has been phenomenally high: 
typically 500 Mq yr^^ kpc^^, and reaching rates 10 times higher intensity at its peak when it was active This 
is more like what one typically expects of ultraluminous galaxies, and suggests that during active star forming 
phases, which appear to last for similar timescales of a few 10^ yr, the local (i.e. <10-100pc) environment 
around AGN is comparable to that in ULIRGs. 



4.6. Extinction 

So far we have implicitly assumed that there is negligible extinction, and already we account for 20% 
of the bolometric (i.e. 0.3-1000/im) luminosity of the entire galaxy. 

If we take the reasonable stance that the gas and stars are mixed (rather than the gas lying in front of the 
stars as a foreground screen) , then when considering the effects of extinction we should use the appropriate 
mixed model for which the reduction in observed intensity depends on optical depth through the gas and 
stars as I/Iq = (1 — e^'^'"''')/Tjnix- The usual scaling of the optical depth with wavelength tk = O.lry 
still applies, as does the definition of the reddening A\ = —2.5logI\/I\o. This means that in contrast to 
the screen model for which A\ — 1.09ta, in the mixed model a modest observable reddening can hide the 
existence of a very large optical depth of gas and dust since Ax = 2.5 log ta (for t\ > 3). Note, however, 
that often extinctions for mixed models are given as a sort of 'screen equivalent', giving rise to very large 
values. To avoid this confusion when discussing flux attenutation, in the remainder of this paper we refer to 
the more physical quantity optical depth t\ rather than the observationally motivated extinction A\. 

As an illustration, a reasonable optical depth of Ty = 10 for the mixed model would mean that 90% 
of the UV and optical light, which dominate the spectral energy distribution, would be re-radiated in the 
mid- to far- infrared. Additionally, the scale of the starburst would increase since Lk becomes 1.6 times that 
observed. Thus the net effect is that the bolometric luminosity of the starburst would be L^oi/Lq = 9.7 
and account for approximately 50% of the infrared (8-1000/zm) luminosity (although none of the optical 
luminosity). Similarly, one can derive the maximum possible mixed model optical depth, for which the 
starburst accounts for all the 8-1000/im luminosity, to be Ty = 26. This is a strong constraint on the 
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extinction, and wc return to this point in Section 6.3. 

Do we see evidence for any extinction? The F160W image in Fig. 2 highhghts a curious feature in the 
central arcsec. In addition to the bright point source associated with the AGN, there is a strong contrast 
in brightness between the SW and NE sides of the nucleus - corresponding respectively to near and far 
sides of the galactic disk for the orientation above. This is reflected in the corresponding spectra extracted 
from the SINFONI data cube, as Fig. 10 shows. The nuclear spectrum is characterised by a flat slope and 
shallow CO bandheads indicative of dilution by hot dust and perhaps also extinction. The spectrum to the 
south-west has a bluer slope and deeper bandheads, both consistent with pure un-reddened stellar light. In 
contrast, the north-eastern spectrum has deep bandheads but a flatter slope, suggesting it is reddened (but 
not diluted). A differential extinction (screen model) of Ak = 1-2 would produce the observed change in 
spectral slope. On the other hand, for the mixed model, any (largish) extinction is possible. For a mixed 
model, the differential extinction between 2.1 and 2.3/im saturates at this level, and cannot make the slope 
redder. One might conclude that while the star formation on the south west side might be predominantly 
unobscured, that on the north east side is probably mixed with considerable dust and gas. Conversely on 
larger scales of 1-2", Chapman et al. (2000) found evidence from optical and infrared colour maps for greater 
extinction on the south west side. 



5. Stellar Kinematics and the Black Hole Mass 

None of the velocity fields in Fig. 4 show evidence for the extreme warp proposed by Schinnerer et al. 
(2000) based on their 0.6" resolution CO (2-1) data. The existence of a small warp would not be surprising 
since very high spatial resolution mapping of masers indicate that on small spatial scales warps may be 
common in galaxy nuclei ~ for example NGC 4258 (Herrnstein et al. 1996), NGC 1068 (Greenhill et al. 1996), 
and Circinus (Greenhill et al. 2003). The reason for the apparent discrepancy is that Schinnerer et al. 2000 
had interpreted the high gas dispersion as spatially unresolved rotation or inflow - whereas our data show 
that the gas dispersion CTgas ^ 120kms~^ is intrinsically high. This is not very different from the stellar 
dispersion (t* of 140-160 km s^^, which indicates that random motions rather than ordered rotation dominate 
the kinematics of the stars. As indicated above, the observed data are the superposition of kinematics of 
the bulge and nuclear component, and this may be responsible for some of the unusual characteristics of the 
dispersion most easily seen in Fig. 11. This shows that cr* is constant at larger radii (out to our radial limit) 
but decreases at 0.1-0.2". This is exactly the radius at which Fig. 7 shows an increase in the intensity of 
the stellar light from the nuclear component. It is therefore possible that at larger radii the observable cr* 
is dominated by the older bulge, but at smaller radii a,: is more strongly affected by the stars in the nuclear 
region. If this is indeed the case, then the subsequent increase of a,: very close to the nucleus could be due 
to the influence of the black hole. 

To test whether the behaviour of cr* is consistent with what one might expect from the nuclear star 
cluster and black hole, we have made a simple dynamical model. This is realised as an edge-on thin disk 
for which the mass profile follows the luminosity profile given in Fig. 7, and accounts for both the gas and 
stellar mass. For this purpose, we have ignored the luminosity from the large scale bulge (denoted by the 
dashed line in the figure). To simulate the random motions which we have shown are important, we have 
convolved the model spectrally with a Gaussian having a = 120kms^^. The model is then convolved with 
the effective spatial resolution and finally 'observed' (see Davies et al. 2004a, b for more details of how the 
model is generated). Although this model does not represent the true 3-dimensional mass distribution, the 
radial distribution is correct and hence it is able to provide a first estimate of the way in which the velocity 
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dispersion changes with radius. In addition it can, at least quantitatively, show how the rotation velocity is 
expected to change with radius. The first conclusion is that the increase in cr, at r < 0.1" is indeed consistent 
with the nuclear mass distribution and presence of a black hole with mass of order 2 x 10^ Mq. The second 
conclusion confirms that, as expected, the system cannot be rotationally supported: the measured velocities 
in Fig. 11 are only 60-65% of those needed for this. Nevertheless, the steep increase in Vrot out to r = 0.1" 
and subsequent more gradual rise do match the shape of the model rotation curve. 

In order to understand the kinematic behaviour more fully and to determine A'/bh from the stellar 
kinematics, we have constructed an appropriate Schwarzschild orbit superposition model, which is described 
and discussed below. NGC 3227 is a good candidate for such an analysis because it has already been the 
focus of other efforts to derive Mbh- These include reverberation mapping, which yielded 4.2 ± 2.1 x lO"^ M© 
(Peterson et al. 2004); X-ray variability measurements, giving 2.2 x 10^ M© (Nikolajuk et al. 2004); and the 
Mbh — (J* relation itself, leading to 3.6 ± 1.4 x 10^ M© (Nelson et al. 2004). Based on these masses and the 
velocity dispersion of 136 kms^^ measured by (Nelson et al. 2004), one can estimate the 'radius of influence' 
of the black hole to be Vg ~ GMbh/c^ ~8pc. Hence the black hole dominates the dynamics in the central 
16 pc, a region which is easily resolved at our 7pc FWHM spatial resolution. 



5.1. Schwarzschild ModeUing 

Schwarzschild's (1979) orbit superposition technique has become the standard tool for deriving Mbh 
from the kinematics of the surrounding stellar spheroid. The procedure commonly involves four steps: (1) 
the photometry is deprojected to get the luminosity distribution i' of the stars; (2) a gravitational potential 
is constructed from trial values for the stellar mass-to- light ratio T and the black hole mass Mbh; (3) 
thousands of orbits in this potential are combined to match the stellar luminosity distribution and the 
kinematical constraints; (4) T and Mbh are systematically varied to find the optimal solution in a x^ sense. 
The specific implementation of the method we use here is detailed in Thomas et al. (2004). The program is 
based on the code of the Nuker team (Richstone et al. in preparation) that has been used to measure black 
hole masses in numerous elliptical galaxies (Gebhardt et al. 2003), including CenA (Silge et al. 2005). We 
apply these models here with the aim of constraining the black hole mass rather than necessarily deriving 
uniquely the kinematic structure of the entire system. 

The validity of our models rests on two major assumptions: that the central region of NGC 3227 is (1) 
stationary and (2) axisymmetric. The first assumption is difficult to verify independently. From the quality 
of our fits to the data (cf. Fig. 16) we can however conclude that - within the uncertainties - this assumption 
is consistent with the data. To which degree the second assumption holds for NGC 3227 can be estimated 
from variations in the kinematics from quadrant to quadrant (cf. Sec. 5.1.2). Providing means to measure 
such variations is one of the advantages of using a full 2-dimensional field of view as we do here. The other 
advantage is that such data constrain the kinematics of all kinds of orbits that are needed to map the system. 

The present models for NGC 3227 are calculated on a grid with 7 radial and 4 angular bins in each 
quadrant (Fig. 12), giving a total which is comparable to the number of spatially independent regions in the 
data. The binning resolution is matched to sample more frequently the regions where the greatest variations 
in V and a are expected to lie. For each mass model about 2 x 3300 orbits are tracked (the initial factor 
2 due to the inclusion of a prograde and retrograde version of each orbit) which, based on the criterion in 
Richstone et al. (2004) is easily sufficient. 

We now discuss how the observations are prepared for the models. 
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5.1.1. Photometry 

As we have shown, the luminosity distribution comprises two parts: the bulge and the nuclear star 
forming region. 

The bulge, which has an r^/^ profile with an effective radius in the K-band of 3.4", contributes 40% 
of the nuclear K-band luminosity and outside of this immediate region dominates the K-band luminosity to 
a radius of 9". Due to its high mass-to-light ratio (M/Lfc), it has an important effect on the model. We 
are unable to constrain its M/Lk ratio directly, but from a literature search Forster Schreiber et al. (2003) 
found that empirical determinations lie in the range 10-30 Mq/Lq - similar to those predicted by population 
synthesis models for old (~ lOGyr) populations. 

The nuclear star forming region contributes 60% of the nuclear K-band luminosity and has a K-band 
luminosity profile that is well matched by an exponential profile with a change in scale length from 0.037" 
to 0.38" at 0.11". Its M/Lk ratio is well constrained via population synthesis models to be O.SMq/Lq. 
However, its mass could be dwarfed by that of the molecular gas which lies in the same region and which we 
assume has the same radial profile. The gas clearly contributes significantly to the gravitational potential, 
and implies that the effective M/Lk ratio should probably be rather higher. 

As a novel feature of our models, and due to the uncertainties in the M/Lk ratios, we allow the 
contributions of the two components to the total (stellar) mass-profile pstar to be varied independently. To 
this end the nucleus and the bulge are deprojected separately using the program of Magorrian (1999) and 
combined via 

Pstar = Tf"nuc t^nuc + Tf"bul f^bul- (1) 

Thereby, the (K-band) mass-to-light ratios Tnuc and Tbui of nucleus and bulge (j^nuc and I'bui are the 
corresponding deprojections) are assumed constant with radius. We set the ellipticity of nucleus and bulge 
Enuc = Cbui = 0.3. The inclination can be varied arbitrarily, but must be equal for both components (to 
ensure axisymmetry) . 



5.1.2. Kinematics 

The velocity and dispersion were found for each radial and angular bin shown in Fig. 12, by calculating 
the mean of all spatial pixels within each bin, weighted according to the uncertainty of each measurement. 
The uncertainties were combined in a similar fashion to estimate the standard error of these means. The 
position angle which defines the major axis, and hence the orientation of the bins, is relatively well con- 
strained. We have shown that on scales of 140 pc and more, there is no doubt that it is at —30°. On the 
scales we consider here, less than 50 pc, the data indicate a preference towards —45°. We have therefore 
adopted —40°, which is sufficiently close to both limits that, given the size of the angular bins, the small 
uncertainty will have no impact on the resulting model. As input to the orbit models binned line profiles are 
generated from the measured kinematical parameters (see, for example, Thomas et al. 2005). Uncertainties 
in these parameters are propagated on the basis of Monte Carlo simulations. 

The stellar dynamical determination of the black hole mass for NGC 3227 suffers from two difficulties: 
Firstly, due to the constraints of signal-to-noise, it was not possible to derive the Hermite terms /13 and /i4 
from the spectra in any meaningful way. These higher order moments, however, contain important informa- 
tion about the distribution of stellar orbits (Dehnen & Gerhard 1993). Unfortunately, due to the so-called 
"mass-anisotropy degeneracy" uncertainties in the orbit distribution directly translate into uncertainties in 
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the derived black-hole masses. Anyway, in order not to bias our models we allow for a rather large range of 
/13 and /14 by using h^ — h^ = ± 0.1 when deriving the line profiles and their uncertainties. 

The second limitation originates in the quadrant-to-quadrant variations of v and a, which are generally 
larger than the statistical errors. This indicates that the modelled region is not exactly axisymmetric. When 
averaging the kinematics from the four quadrants these variations are taken into account in the error bars. 

In summary, the assigned error bars to the kinematical input parameters arc rather large and mostly 
reflect (1) systematic uncertainties related to central deviations from axisymmetry and (2) our ignorance 
about the higher order Gauss-Hermite moments. 



5.2. The Black Hole Mass of NGC 3227 

The results of the modelling are shown in Figs. 13-16. The best fitting model has an inclination of 
i ~ 60° (fully consistent with that of the isophotes) and its black hole mass is Mbh = 1-5 x 10^ (Fig- 13). 
The corresponding stellar mass-to-light ratios are Tbui = 27.5 and Tnuc — 2.5, respectively. As Fig. 13 
shows, the difference between i = 60° and i = 75° is formally only at the one sigma level. In any case, the 
best-fit black-hole mass does not depend strongly on the assumed inclination. 

It turned out after the modeling that the uncertainties assigned to /13 and /14 probably overestimate the 
actual freedom in the orbit distribution. From all calculated models we found the 68 percentiles x^3/iVdata = 
0.01 and Xhi/^data = 0.06, respectively. Consequently, the variations in the fitted models are much smaller 
than the orignially allowed A/13 — Ahi = 0.1. To illustrate the effect this has on the derived confidence 
intervals we have rescaled these error bars to a third of their original value and recalculated A^^. The 
result for the case i =^ 60° is shown as the dashed line in Fig. 13. As apparent, it does not alter the best-fit 
black-hole mass significantly, but changes the confidence intervals drastically. 

In the discussion above, we have allowed the Schwarzschild model to find the best mass to light ratios 
as free parameters. On the other hand, as previously intimated, these are in fact already limited by external 
constraints based on other arguments, leaving Mbh as the only truly free parameter. In particular M / Lk 
for the bulge is unlikely to exceed 30Mq/Lq (Forster Schreiber et al. 2003); and the effective (stars plus 
gas) M/Lk for the nuclear component is likely to be in the range I-5M0/L0 (Sec. 6.3). It is for this reason 
that in Fig. 14, which shows the dependencies between the three dynamical parameters, we have calculated 
confidence intervals for each pair of parameters assuming that the third is fixed. The combination of the 
degeneracies and the external constraints puts stronger limits on the range of possible black hole masses 
than the Schwarzschild modelling alone. 

The figure shows a degeneracy between Tnuc and Mbh • Black hole masses of Mbh = 7x10^ — 2x10'' M© 
can be fit with Tnuc = 1 — 5Mq/Lq. The reason for the degeneracy is illustrated in Fig. 15: the nuclear 
component dominates the stellar mass density in the centre inside 0.1" and a larger Tnuc can therefore 
compensate a lower Mbh- Since its contribution to the central mass density is lower, the bulge Tbui is less 
dependent on the black-hole mass. But due to their similar light-profiles around 1.0", the bulge and nuclear 
mass-to-light ratios are coupled strongly to each other. From Fig. 14, we find that the M/Lk ratio for the 
bulge should lie in the range Tbui = 25 — 35Mq/Lq; and much higher ratios are unlikely. This is within 
the range mentioned above, and gives us confidence that the model is converging on a physically meaningful 
solution. 

In summary, we conclude that the range of Mbh we derive - which we stress again is rather large due to 



16 



the uncertainties discussed in Sec. 5.1.2 ~ is reasonably robust to the uncertainties in the input parameters. 
The limits of the range are set not only by the Schwarzschild models themselves but also by additional 
constraints on the mass to light ratios which, through degeneracies between the parameters, affect the black 
hole mass. The upper end is consistent with, although still less than, the other mass estimates mentioned at 
the start of the Section; while the lower end is an order of magnitude smaller, suggesting that these techniques 
may tend to overestimate the black hole mass. Nevertheless, that the preferred black hole mass we derive 
via Schwarzschild modelling is within a factor of 2-3 of the masses found by other means and suggests that 
all methods are reasonable at least to this level of accuracy. With respect to the Mbh — c* relation, it is 
interesting that in contrast to NGC 3227 where we have found above that the stellar dynamical mass is a 
factor of a few less than the relation predicts, the equivalent mass for Cen A (Silge et al. 2005) is a factor of 
several greater. This suggests that, while the Mbh ~ f* relation may be useful for order-of-magnitude Mbh 
estimates, the scatter for active galaxies may be significantly larger than that for quiescent galaxies. 



6. Molecular Gas and the Torus in NGC 3227 

The 1-0 S(l) morphology in Fig. 3 is remarkably complex. As has been noted by Schinnerer et al. (2000) 
and Baker (2000) on larger scales for the cold molecular gas, we now find on smaller scales for the hot 
molecular gas that the nucleus itself is not identified with the strongest emission. The distribution is rather 
elongated at a position angle of —45° (we note that there are inherent uncertainties and difficulties due to 
the continuum subtraction associated with the 100° reported by Quillen et al. (1999) and the 45-70° from 
Fernandez et al. (1999)). It is tempting to interpret the emission in terms of a bar and arclets. However, the 
kinematics and particularly the surprisingly high velocity dispersion of 100-125kms^^ argue against it; a 
more natural interpretation is in terms of a highly turbulent medium, in which nevertheless the bulk of the 
gas follows uniform rotation. There do appear to be some non-rotational motions, but associated only with 
weaker 1-0 S(l) emission and restricted to the galaxy's minor axis. This may indicate that some of the hot 
H2 is influenced by outflows, or that the emission from hot gas which we see here does not reflect well the 
distribution of the cold gas. Fig. 4 shows that the velocity fleld of the brighter emission - corresponding to 
2/3 of the flux in our field of view, and hence also the bulk of the gas mass - exhibits pure rotation. These 
facts leads one towards the conclusion that most of the gas in the nucleus probably exists in a thick rotating 
disk. 

Fig. 18 supports this interpretation. It shows the rotation velocity K-ot, corrected for inclination, and 
the dispersion a for the molecular gas. The data from the CO (2-1) data (Schinnerer et al. 2000) at larger 
scales and the SINFONI data on smaller scales are fully consistent. Also shown is the local Vrot/c ratio 
(rather than the global version which compares the maximum rotation velocity to the central dispersion). 
At r > 1" the ratio Vrot/c clearly implies that the gas lies in a thin disk. At smaller radii, the ratio decreases 
smoothly, indicating that the gas distribution becomes geometrically thicker with respect to the radial scales. 



6.1. Star Formation, Molecular Gas, and the Torus 

Above we argue that the gas lies in a thick disk, and in Section 4.2 that this is also the most likely 
distribution for the stars. Although the morphologies appear different - the stellar continuum is more 
centrally concentrated and the isophotal position angle is closer to north - much of this arises from the bulge 
contibution to the light. Crucially, the kinematics for the gas and stars are remarkably similar: the clear 
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velocity gradient at PA -30 to -45°, and the high dispersion of more than lOOkms"^. Because they exist on 
similar spatial scales of a few tens of parsecs and exhibit similar kinematics, we conclude that the stars and 
gas are physically mixed. 

Scales of a few tens of parsecs are exactly those on which theoretical models predict the obscuring torus 
should lie (Pier & Krolik 1992a, 1993; Granato & Danese 1994; Nenkova et al. 2002; Schartmann et al. 2005). 
In addition, an important property of the torus is its ability to collimate the UV radiation from the AGN 
to produce ionisation cones whose apex lies close to the AGN, i.e. it is geometrically thick. We have argued 
that our data show the gas and stellar distributions are thick. Our further conclusion is therefore that the 
gas and stars we see in the nucleus of NGC 3227 represent much of what is understood by the term 'obscuring 
torus'. 

The idea of a star forming torus is not new, and has already been modelled by Wada & Norman (2002) 
and Wada (2005) in response to the evidence that about half of Seyfert 2 nuclei have a nuclear starburst. 
However, from this work and that presented by Davies et al. (2004a, b), it is now becoming clear also for 
type 1 Seyfert nuclei that star formation on scales associated with the torus is an energetically important 
process. 

If the gas associated with the 1-0 S(l) emission were indeed part of the obscuring torus, it would need to 
have a third crucial property: a high column density. We address this issue in Section 6.3, but first consider 
below how the vertical height of the gas distribution might be maintained. 



6.2. Supporting the Vertical Thickness of the Torus 

That the torus in NGC 3227 is vertically extended is clear from Fig 18: the ^ 100 km s^-'^ velocity 
dispersion of the gas is larger than its inclination corrected rotational velocity out to radii of 30-40 pc. Even 
the lower CO (2-1) dispersion of ~ 60kms~^ is still significat compared to the rotation velocity to these 
radii. Thus the kinetic energy is dominated by random motions and implies a more spheroidal structure 
rather than simple ordered rotation in a thin disk. Its constituent molecular clouds will undergo cooling via 
collisions on short timescales, comparable to the orbital period of a few million years. As a result energy 
must be constantly injected into the interstellar medium in order to maintain the vertical structure. The 
sources available to supply this energy are the AGN and the starburst, the latter via both radiation pressure 
and mechanically through supernovae. Such a situation may also provide, at least qualitatively, an answer 
to why the star formation should have stopped while there is still a large gas reservoir. 

The effect of radiation pressure from the AGN itself has been explored by Pier & Krolik (1992b), 
and in their model depends primarily on the Eddington ratio of the black hole and the dumpiness of the 
torus. For the black hole mass, we adopt a value commensurate with our range foudn in Section 5.1, 
and those from X-ray variability (Nikolajuk et al. 2004) and reverberation mapping Onken et al. (2004), 
that is Mbh = 2 x IO'^Mq. For the AGN luminosity, we take half the bolometric luminosity since the 
other half is supplied by the nuclear starburst, that is Lbh = 10^°Lq. Since the Eddington luminosity 
IjEdd — 3.3 X 10'^Mbh in solar units, this yields an Eddington ratio of Len/LEdd ~ 0.015. This value is 
an order of magnitude smaller than the ratio of 0.1 that Pier & Krolik (1992b) found, was needed for a 
smooth torus to be thick and static. On the other hand, for tori consisting of large clumps, they found that 
in the range 0.01 < ^Edd ^0.1 an equilibrium would exist for thick tori with 1 > a/h ^0.1 where a is the 
inner radius and h is the full height in the body of the torus. In the case of NGC 3227 this could explain a 
thickness of up to a few parsecs but not the ten to a few tens of parsecs implied by the velocity dispersion. 
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unless the AGN was much more active in the recent past. 

The possibihty that supernovae might heat the molecular torus is treated specifically in the models of 
Wada & Norman (2002). However, because the coupling from supernova energy to kinetic energy of the 
ISM is rather weak, they used a very large supernova rate of :^sn '^ 1 yr^^- This is two orders of magnitude 
larger than the current rate in the central 60 pc of NGC3227, which is already close to the maximum for 
our best fitting star formation history. Hence, although they were able to reach scale heights for the torus 
of 10-20 pc, it seems unlikely that this could work for NGC3227. 

Recently, Thompson et al. (2005) have addressed the issue of whether radiation pressure from the 
starburst itself can provide the vertical support. For NGC3227 it is not clear that their optically thick 
model can be applied, since we have argued in Section 4.6 that the extinction must be low enough that even 
in the near infrared the optical depth cannot exceed T2.2 ~ 2.6 and is probably no more than T2.2 = 1- In 
their optically thin limit - where the optical depth to UV photons is >1 while to infrared photons from dust 
reprocessing it is <1 - they derive the star formation rate necessary to maintain marginal Toomrc stability 
[Q ^ 1), which for their model would give a scale height of '-^lOpc at a radius of 30 pc. If we assume a 
gas fraction of unity, and a ^ lOOkms^^ as measured for the 1-0 S(l) line we find a star formation rate 
^20MQyr^^ would be needed. This is two orders of magnitude greater than the current rate derived in 
Section 4.5 but only a factor of 6-7 more than the peak rate when the star formation was active. There are 
two ways to reconcile this difference. Using the 60kms^^ dispersion of the CO (2-1) data of Schinnerer et 
al. (2000), one finds the star formation rate necessary to maintain it is only 2.5M0yr~^. One could then 
argue that the higher dispersion of the 1-0 S(l) line is due to the fact that this line traces only the hot gas, 
and therefore will be strongly influenced by small scale turbulence. Alternatively, one could argue that the 
gas fraction /g is less than unity: for /g = 0.5, one derives a star formation rate of 5 Mq yr^^. 

Either of these alternative estimates are reasonably consistent with the peak rate derived from the 
observations. However, they are both still far greater than the current rate. It is perhaps conceivable that 
the molecular gas and star formation do not reach an equilibrium state. Instead, we speculate that what 
we are seeing is evidence for marginal Toomre stability: that once the gas cools, star formation may begin; 
but that once radiation pressure from the young stars is sufficiently high to heat the gas disk so that its 
turbulence increases and it thickens, star formation can no longer proceed. The molecular gas will then 
cool as the stars age, and sink back into a thinner configuration, allowing the process to start again. To 
investigate this in a more quantitative sense, we have calculated the Toomre Q parameter, which defined in 
the usual way as 
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where, using the data in Fig. 18, a is the velocity dispersion and k is the epicyclic frequency. The mass 
surface density S we have estimated as follows. For the CO (2-1) data it is derived from the velocity curve 
under the approximation of Keplcrian rotation in a thin disk. For our SINFONI 1-0 S(l) data, we have 
considered two cases: jjl is constant within the area considered; and S follows the stellar light distribution, 
being centrally concentrated. For both of these we have assumed a total mass out to r = 0.5" of 10® M© (see 
Section 6.3). The true distribution will lie somewhere between these two extremes. The resulting estimates 
of Q are shown in Fig. 19. The difference in Q derived from the 1-0 S(l) and CO (2-1) data is a direct result 
of the differing a apparent in Fig. 18 and to k. which depends on the details of the rotation curve. What the 
figure shows is that as one approaches the nucleus from large radii of 1-2", Q increases; and within 0.5"of 
the nucleus, on average Q > \. Thus, the nuclear region does appear to be too dynamically hot to form 
stars. Based on the star formation timescale we derived earlier, we suggest that cycle of heating and cooling 
probably occurs on relatively short timcscales of order 100 Myr. Thus time-dependent modelling may be 
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necessary if the geometry of the torus changes as the star formation or AGN fueUing pass through successive 
active and quiescent phases. 



6.3. Mass and Column Density of the Gas 

The final thread in our analysis of whether the molecular gas in the nucleus of NGC 3227 can be identified 
with the obscuring torus concerns its column density. Our data trace only the hot >1000K H2 through the 
l-OS(l) line, for which our flux of 1.1x10"^^ Wm~^ in a 0.8" aperture is consistent with that in Rodriguez- 
Ardila et al. (2004) and Quillen et al. (1999). Remarkably, it is possible to use this to make a reasonable 
estimate of the total molecular gas mass. 

That there exists a relation between molecular mass and 1-0 S(l) luminosity in actively star forming 
galaxies should not be surprising, given that relations are already known between 1-0 S(l) and infrared 
luminosity Ljr (Goldader et al. 1997), and between Ljr and CO luminosity Lqq which traces gas mass 
(Young & Scoville 1991). By comparing the 1-0 S(l) line luminosities of 17 luminous and ultraluminous 
galaxies experiencing intense star formation (some of which also host an AGN) to gas masses derived from 
millimetre CO 1-0 luminosities, Mueller Sanchez et al. (2006) found that 

Mgas[M0] ^ 4OOOLi_os(i)[i0] (mean) 

with a standard deviation of about a factor of 2. The exception to this was NGC 6240 which is known to be 
overluminous in 1-0 S(l) and for which the scaling factor is by far the lowest, about an order of magnitude less 
than the typical value. The most likely reason for this is that the line does not originate in star formation, 
but is due to cloud collisions resulting from the extreme turbulence in the gas (Sugai et al. 1997; Tacconi et 
al. 1999). This particular case therefore provides us with a way to estimate a conservative lower limit on the 
gas mass using 

Mgas[M0] ^ 43OLi_os(i)[i0] (NGC 6240). 

For NGC 3227 this relation implies a gas mass in the central 0.8" (65 pc) exceeding 4x10^Mq and perhaps 
as much as (2~8)xlO^M0. 

We now compare this range of estimates to three other constraints on the mass. In Section 4.5 we found 
that the mass-to-light ratio of the stars in this same nuclear region is M*/Lk = 0.5Mq/Lq, with a young 
stellar mass of 2.0 x lO"^ Mq. On the other hand, the Schwarzschild modelling in Section 5.1 implies that the 
most likely effective mass-to-light ratio including gas, is Mtotai/^K = 1 ^ HMq/Lq. And this result would 
imply a gas mass in the nuclear region of Mgas — 0.2 — 1.8 x 10^ Mq, and a most likely mass of about half 
this. 

The mass estimated by Schinnerer et al. (2000) is rather less, but did not include any correction to 
take into account the random motions inferred from the large velocity dispersion. We can make a rough 
estimate of the dynamical mass by including with the Kcplerian M = V^^r/G the contribution of these 
random motions so that M ex V^^^ + 3(j^ (where the factor 3 comes from a simple comparison of the kinetic 
energies associated with the two quantities). Taking from the CO (2-1) data in Fig. 18 that V^ot ~ 50kms~^ 
at 0.4" radius and a = eOkms^^, we find M - 1.2 x 10^ Mq. 

Finally, in Section 6.2 we argued that, if radiation pressure does support the vertical thickness of the 
gas distribution, then the gas fraction may be only /g ^ 0.5, implying equal masses of stars and gas. Thus 
Mgas could perhaps be as low as 2.0 x 10^ M0. 
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All the estimates above point to the same range of masses, and so we can conservatively claim that 
the gas mass is most likely 2-20 x 10^ M©. While there is an order of magnitude uncertainty, this still has 
important implications. Assuming the gas is uniformly distributed over the 0.8" aperture, the mean column 
density through the entire region would be nn = (0.8-8) xlO^^'cm^^. If the gas does represent the torus, 
then such a large column should not be unexpected since there are many examples of type 2 AGN for which 
hard X-ray observations indicate that the AGN itself is hidden behind similar amounts of gas - although 
it should be noted that the X-ray absorption may occur on very small scales. Two particularly well known 
cases are: NGC4945, for which the bright nuclear emission below ^lOkeV is completely blocked, implying 
an absorbing column of ~ 5 x lO^** cm^^ (lawasawa et al. 1993; Done et al. 1996); and Circinus, for which the 
absorbing column was estimated to be ^ 4 x 10^** cm~^ based on the direct detection of X-rays at energies 
greater than lOkeV (Matt et al. 1999). 

For a standard gas-to-dust ratio with rin [c?7i~^] — 1.5 x 10^^ ry, the column density implies an optical 
depth in the visual of ry > 500. Since we are implicitly assuming that the stars are mixed with the gas 
rather than hidden behind a foreground screen, we must again use the mixed extinction model which we have 
used previously. Hence we can compare directly the optical depth estimate here with the maximum possible 
to the stars of Ty = 26 from Section 4.5. The two estimates are totally inconsistent with each other. The 
solution to this contradiction lies in the gas, instead of having a uniform distribution, being concentrated 
into clumps. If the clumps are sufficiently compact then most lines of sight will not be intersected - and 
hence the stellar light will suffer little extinction. The degree of dumpiness need not be extreme: even local 
increases in density of a factor of a few would be sufficient to reduce the covering factor of the gas to half, 
consistent with the maximum stellar extinction. 

The gas density in these clumps will be high: the mean density over the whole region for a uniform 
distribution and assuming the gas is extended vertically as much as it is laterally (i.e. by ~60pc) is (4- 
40)xl0''cm~'^. This is already comparable to the typical densities in parsec-scale cores of molecular clouds. 
It is therefore no surprise that, as reported by Rodriguez- Ardila et al. (2004), the near infrared H2 line ratios 
indicate that the hot gas is thermalised. 

In summary, we have used several independent mass estimates to put limits on the gas mass within 
30 pc of the AGN. This mass implies very significant column densities of gas, easily sufficient to fulfil the 
third criterion of the obscuring torus mentioned in Section 6.1. Thus, we can conclude that the gas we have 
observed via the 1-0 S(l) is likely to be associated with the molecular torus. If so, then we have demonstrated 
that the gas in the torus is clumpy in nature, and that the torus supports episodes of active star formation. 



6.4. The Inner Edge of the Torus 

A curious result from the datacube is that the 2.1 and 2.3/im continuum centroids do not coincide. 
Since the stellar continuum increases to shorter wavelengths, while the non-stellar continuum (i.e. hot dust 
emission associated with the AGN) increases to longer wavelengths, this can be most easily explained if 
these two continua are offset from one another. However, the scales involved are similar to those expected 
for differential refraction between the two wavelengths, about 0.01", and this makes the two phenomena 
difficult to disentangle. 

Instead, we can consider the separation of the continuum into its stellar and non-stellar components via 
the CO bandheads longward of 2.3/im. Because the two components are derived at the same wavelength, 
they are not susceptible to differential refraction. Yet the offset clearly remains, as shown in Fig. 20: the 
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non-stellar continmim is shifted by about 1 pixel (0.0125" or 1 pc, approximately 1/8 of the PSF size) to the 
north-east. This effect is not due to a few pixels which happen by chance to be brighter or fainter, but is a 
systematic global shift of the entire feature; and is significant with respect to the 0.05 pixel formal errors in 
each axis from a Gaussian fit to the respective profiles. 

Our interpretation is that we are seeing directly the inner edge of the torus. We have shown this 
schematically with the cartoon in the right panel of Fig. 20, where for simplicity we have drawn the torus as 
a uniform ring with sharp edges; this representation should not be taken too literally. If the torus is oriented 
in the same way as the larger (i.e. >100pc) scales, it will be inclined with its major axis on a line from 
south-east to north-west. As a result, the inner edge will only be visible on the far side; the near side will 
be hidden by the outer layers, since the torus is presumably optically thick. Regardless of whether the torus 
is modelled as a uniform or clumpy medium, its inner boundary is always expected to be a radius of ^1 pc 
(e.g. Schartmann et al. 2005). This distance is set by the sublimation temperature of the dust grains, since 
they cannot exist any closer to the AGN. Depending on the thickness and inclination of the torus, one then 
would expect the centroid of the observable emission from the hottest dust grains (i.e. those at ^ 1000 K, 
heated by the AGN rather than by stars) to lie at a projected distance of ~lpc from the AGN itself. In 
contrast, since star formation occurs throughout the torus, the centroid of the stellar continuum is centered 
on the AGN. Hence one would expect to find, and indeed Fig. 20 shows, an offset between the stellar and 
hot dust continua. 



7. Conclusions 

We have presented new near infrared integral field spectroscopic data at 70kms^^ FWHM spectral 
resolution of the Seyfert 1 nucleus in NGC 3227, making use of adaptive optics to reach a spatial resolution 
of 0.085" (7pc). In this paper we have addressed the issues of star formation and molecular gas around the 
AGN and their relation to the canonical obscuring torus; and we have analysed the stellar kinematics using 
Schwarzschild models to determine the black hole mass. Our main conclusions are: 

• The nuclear star forming region around the Seyfert 1 nucleus is spatially resolved, on scales of a few 
parsecs to a few tens of parsecs. The most recent episode of intense star formation began ~40Myr ago 
but has now ceased. Within 30 pc of the AGN this starburst still accounts for 20-60% of the galaxy's 
bolometric luminosity. Despite showing evidence for moderate rotation, the stars' kinetic energy is 
dominated by random motions indicating that they lie in a thick disk. 

• Schwarzschild modelling of the stellar kinematics lead to a black hole mass in the range Mbh = 
7 X 10^ — 2 X 10^ Mq. The upper end is consistent with (although still less than) previous estimates 
made using other techniques. The large range arises through a degeneracy in whether mass is attributed 
to the black hole or the stars and gas, which can be resolved with better kinematic line profiles. 

• The gas in the central 80 pc of NGC 3227, exhibits several critical properties that are expected of a 
molecular obscuring torus: its spatial extent is a few tens of parsecs, it is geometrically thick, and it 
has column density of order nn = lO^'^-lO^^ cm~^. This argues that the gas we have observed is the 
torus. Moreover, based on the similarity of their spatial extents and their kinematics, it is likely that 
the gas and stars are physically mixed. Thus the torus also supports episodes of active star formation. 

• It seems unlikely that the current level of AGN or star forming activity can inject sufficient energy 
into the ISM to maintain the vertical thickness of the torus. However, this was possible when the star 
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formation rate was at its peak value. We speculate that the torus may heat and subsequently cool, 
changing its vertical profile, as the star formation and AGN go through active and quiescent phases. 



The authors are grateful to the staff at the Paranal Observatory for their support during the observations, 
and for the entire SINFONI team at MPE and ESO. They are also indebted to Carole Mundell for kindly 
finding and re-reducing the radio continuum data. Finally, they thank the referee for comments which helped 
improve the manuscript. 
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Tabic 1. Measured fluxes and flux densities for NGC3227 



Item 


Measurement*^ 




unit 


H2 1-0 S(l) 


10.9*^ 


10- 


-i8Wm-2 


narrow Br7 


4.6b,c 


10- 


-i8Wm-2 


broad Br7 


42^^ 


10- 


-i8Wm-2 


[Caviii] 


1.5b 


10- 


-i8Wm-2 


total K-band continuum 


32 


10- 


-i5Wm-Vm"^ 


Wco 


3.6'^ 


A 




young stellar K-band continuum 


7.0'' 


10- 


-i5Wm-Vm"^ 


old stellar K-band continuum 


4.5'' 


10- 


-i5Wm-Vm"^ 



** All measurements are given for a 0.8" aperture centered on the contin- 
uum peak. 

'^Uncertainties are approximately 0.3 x 10^^* Wm~^ (3 times larger for 
the broad Br7), and are dominated by calibration and from the stellar 
continuum features. 

"^ We estimate that 25% originates in the narrow line region and 75% 
from star formation, yielding the number of ionising photons from young 
stars to be logQLyc — 51.93. 

'^ The dilution at 2.3/xm implied by Wco is greater than the mean dilution 
over the K-band which is centered at 2.18/xm. 

° Assuming a dilution over the K-band of 2.8; the uncertainty in this 
corresponds to no more than 20% uncertainty in stellar the continuum flux 
density. The division between young and old populations is based on the 
bulge and excess continua as shown in Fig. 6. 
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Table 2. Derived properties for the central 0.8" of NGC 3227 



Item 


Measurement^ 


unit 


logi^/iyQ (stellar) 


7.8 




Mgas 


2-20 X 10^ 


M0 


young stellar component: 






logiif/i© 


7.6 




logibol/i© 


9.5 




initial star formation rate 


3 


Mq yr"^ 


current star formation rate 


0.05 


M0yr-i 


age 


40 


Myr 


e-folding decay time 


10 


Myr 


MJLk 


0.5 


Mq/Lq 


M, 


2x 10^ 


Mq 
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Fig. 1. — Azimuthally averaged profile of the broad Br7 line map, used to estimate the spatial resolution. 
The data points (values for every pixel within 0.3" of the centre) are marked as grey crosses and show no 
asymmetric artifacts which could result from the AO correction. Overdrawn is a Gaussian profile (dashed 
line) and a Moffat profile (solid line). Although the latter better reproduces the wings in the profile, both 
appear to be reasonable apprxoiniations to the profile and both yield the same FWHm resolution of 85 mas. 
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Fig. 2. — Left: the HST F160W (square-root scaling) image after subtracting the bulge-plus-disk model 
derived from the 2MASS image and the HST image at r > 1.6". The field of view of SINFONI is shown as a 
box in the centre. The outline of the stellar ring is drawn as a dashed ellipse, with axis ratio 0.6 at position 
angle —30°. Right: CO (2-1) molecular gas map from Schinnerer et al. (2000), with contours at 40, 60, and 
80% of the peak. The same box and ellipse as at left are also marked on this image. North is up and east is 
left. 
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Fig. 3. — Images from the SINFONI data cube showing the primary continuum and emission hne morpholo- 
gies in the central arcsec of NGC 3227. In each case, the colour scale stretches from 0-100% of the peak flux, 
and contours are spaced equally between 15% and 90% of the peak flux. A crossed circle indicates in each 
panel the peak of the continuum emission. The maps show, from left to right and top to bottom: 2.1/im 
continuum, stellar continuum (derived from the stellar absorption), H2 l-OS(l), narrow Br7, and [Caviii]. 
North is up and east is left. 
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Fig. 4. — Measured kinematics for the gas and stars in the central arcsec of NGC3227. Top: H2 1-0 S(l) 
line; middle: narrow component of the Br7 line; bottom: stars. Left: velocity field, determined as described 
in Section 2; Centre: the same velocity field, masked to include only those pixels with higher fluxes - in 
each case, these pixels make up 2/3 of the total flux; Right: dispersion. The maps for emission lines are at 
a resolution of 0.085"; those for the stars have had additional smoothing, and are at an effective resolution 
of 0.115". Note that in the line dispersion maps, the data have not been corrected for the instrumental 
resolution, which corresponds to ct = 30kms~^. 
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Fig. 5. — Equivalent width of the 2.294 /im CO 2-0 bandhead as a function of age, calculated using STARS 
(which includes the thermally pulsing asymptotic giant branch stars). Several different star formation his- 
tories are shown: instantaneous, a decay time of 10 Myr, a decay time of 100 Myr, and continuous. For each 
history, data are plotted where the K-band luminosity is at least 1/15 of its maximum. This indicates that 
one expects Wco '^ 12 A for nearly all star forming scenarios. 
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Fig. 6. — Luminosity profile (top), ellipticity (centre), and position angle (bottom) as functions of radius 
for NGC 3227. Data from different sources are denoted by: blue circles for 2MASS H-band, green squares 
for HST F160W, red triangles for SINFONI stellar K-band continuum, yellow triangles for SINFONI total 
K-band continuum. In the top panel, the relative scaling has been adjusted according to the overlapping 
regions. The dotted line indicates the large scale disk; the solid line the bulge plus disk. A clear emission 
excess is apparent in the central arcsec, associated with changes in ellipticity and position angle. 
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Fig. 7. — Radial profile of the nuclear stellar light. Filled triangles denote HST F160W data, and open 
squares the SINFONI K-band stellar continuum. The grey dotted line indicates the spatial resolution by 
tracing a Guassian with 0.085" FWHM. The grey dashed line represents the r^/^ bulge fit at 2-10" radii 
extrapolated inwards. The dark solid line indicates a fit to the data including the bulge component and an 
exponential whose scale length changes at a radius of 0.11". 
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Fig. 8. — Plots of WBr-y (left) and I'SN/LK (right) as a function of age for four star formation histories. 
The grey bands indicate the range of the measured ratios, and in both cases only the stellar continuum has 
been considered. The only scenario which yields a consistent age for both diagnostics is star formation over 
a timescale of ^10 Myr which occured ~50Myr previously. 




RA offset (arcsec) 

Fig. 9. — Contour plots of the 6 cm radio continuum kindly provided by C. Mundell. Left panel: at low 
resolution (u,v-tapered image with beam size 0.191" x 0.141"), demonstrating that the 2 blobs seen at 18cm 
(Mundell et al. 1995) break up into discrete sources at higher resolution. Right 3 panels: at high resolution 
(naturally weighted image with beam size 0.076" x 0.053"). Contour levels are at 2, 3.5, and 5 times the 
rms noise of 0.37 mJy beam" ^. Images on which the contours are superimposed are from Fig. 3. It is not 
possible to align the data astrometrically. Instead, we have assumed that the brightest 6 cm peak coincides 
with the peak in the K-band stellar continuum - justifiable under the assumption that the radio continuum 
is due to star formation. This alignment is not critical to the interpretation. North is up and east is left. 
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Fig. 10. — Spectra extracted from the SINFONI data cube in 0.25" apertures centered on the nucleus (red), 
0.25" to the north-east (blue), and 0.25" to the south-west (green). All spectra are normalised at 2.25/im. 
The effect of dilution on the CO bandheads is clearly seen in the nuclear spectrum. 
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Fig. 11. — Left: stellar rotation curve (found using the commonly employed tilted ring model put forward 
by Begeman 1989); Right: azimuthally averaged radial profile of the stellar dispersion. In both cases a fixed 
inclination of 55° at position angle 135° was adopted. The velocity increases steadily to '^80kms~^ at 0.4". 
The dispersion of 160-170 km s~^ drops quickly to 140kms~^ inside a radius of 0.2" - perhaps because of 
the increasing predominance of the young stars; and then begins to increases slowly at the smallest radii - 
perhaps due to the dynamical impact of the black hole. 
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Fig. 12. — Radial and angular binning scheme used as input for the Schwarzschild orbit superposition 
modelling. The same arrangement was repeated for each of the four quadrants. The total number of bins 
(32 X 4) is well matched to the number of indepedent data points, given the spatial resolution of 0.085" and 
the 0.93" field of view. 
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Fig. 13. — Goodness of fit Ax^ 



X 



min(x^) versus black-hole mass (allowing the mass-to-light ratios 



of both the nuclear and bulge components to be optimised in the model each time, irrespective of external 
constraints): i — 75° (red solid) and i — 60° (blue solid). The blue dashed line shows the case for rescaled 
error bars in h^ and h^ (see text for details). 
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Fig. 14. — The range of models calculated (black points) and inferred confidence intervals (contours, in the 
range l-Scr corresponding to Ax^ = {1,4,9,16,25}) for the Schwarzschild models calculated for the best 
fitting inclination of 60°, with the third parameter in each case fixed at its optimal value. Left: as a function 
of Mbh and Mass/Light ratio of nuclear component; Centre: as a function of Mbh and Mass/Light ratio of 
bulge. Right: as a function of Mass/Light ratios of the bulge and nuclear component. 
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Fig. 15. — Best-fit stellar mass density and its decomposition into the bulge and nuclear component, respec- 
tively. 
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Fig. 16. — Comparison of model and data velocity (upper) and dispersion (lower). The 4 panels from left to 
right represent the 4 angular bins (from major axis at left to minor axis at right), the mid-points of which 
are indicated at top. 
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Fig. 17. — Absolute difference between the H2 1-0 S(l) and stellar velocity fields, with contours drawn at 
intervals of 25kms^^. The only major difference between the two is to the north-east, out along the minor 
axis. 
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Fig. 18.— Gas kinematics from the CO (2-1) data of Schinnerer et al. (2000) and our SINFONI H2 1-0 S(l) 
data (filled and empty points respectively). Both are extracted along their kinematic major axis, with the 
two sides averaged. Top: rotation velocity Vrot (triangles) and dispersion a (squares) Bottom: the local 
V^ot/c ratio increases smoothly with radius. The differences between the two data sets are primarily due to 
the beam sizes (0.6" and 0.085" respectively). 
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Fig. 19. — Calculation of the Toomrc Q parameter, based on the kinematics in Fig. 18 for the CO (2-1) 
data and our SINFONI 1-0 S(l) data. For the former, the mass surface density S is estimated under the 
assumption of Keplerian rotation in a thin disk; for the 1-0 S(l) data we have considered the two extreme 
cases of S being either constant or following the centrally concentrated stellar distribution. 
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Fig. 20. — Left and Centre panels: images from the SINFONI data cube showing the steUar and non-stellar 
continua, both at 2.3/im. The large cross on both panels indicates the centre of the stellar continuum. The 
offset of the non-stellar continuum to the north east is easily seen. Right panel: a cartoon of the canonical 
torus showing what might cause this offset. North is up and east is left. 



